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@ Variant proteins and polypeptides having an enhanced affinity, i.e. greater binding strength, for immobilized- 
metal affinity resins have a histidine or aspartic acid residue at each of one or more positions which, in the 
natural protein, are occupied by other amino acid residues, such that the variant protein contains a metal- 
chelating amino acid sequence. The specific sequence depends on the secondary structure associated with the 
portion of the protein or polypeptide which includes such sequence. 
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VARIANT PROTEINS AND POLYPEPTIDES POSSESSING ENHANCED AFFINITY FOR IMMOBILIZED-METAL 

AFFINITY MATRICES 



BACKGROUND OF THE INVENTION 



5 Field of the Invention 



The present invention relates to variant proteins and polypeptides which manifest enhanced affinity for 
inrimobilized-metal affinity matrices. More particularly, the present invention relates to proteins and polypep- 
tides having at least one metal-chelating amino acid sequence engineered thereinto. 

10 

Prior Art 

When recombinant proteins or polypeptides are expressed in bacterial cells and subsequently grown in 

15 fermentation tanks, there remains the problem of recovering the desired product in pure form. Immobilized- 
metal affinity chromatography or ligand exchange chromatography is a well known technique for amino acid 
and protein purification. F. Helfferich, Nature 189, p. 1001 (1961); H. F. Walton et al. Recent Developments 
in Separation Science. Vol. Vf, Chapter 5 1981. Porath et al, Nature (London), 258, p. 598 (1975): 
Biochemistry, 22, p. 1625 (1975) and Sulkowski, Trends in Biotechnology, 3 , p. 1 (1985), have shown that 

20 this technique is well suited for selective fractionation of native proteins according to their content of 
exposed histidine residues. The chelating ligands such as iminodiacetic acid were bound covalently to 
oxirane-activated agarose, and the resulting gels were charged with metal ions such as Cu^* Zn^* Ni^"" or 
Co2 Such resins have since been utilized for purification of several native peptides and proteins. See. for 
example, Nilsson et al. Embo J. 4 , p. 1075 (1985). 

25 It was desirable, however, That a method be developed for purifying all recombinant proteins and 
polypeptides, not just those which inherently contain one or more exposed histidine side chains or residues. 
To this end, hybrid proteins were developed wherein the coding sequence of a protein of interest was fused 
with the coding sequence of a small histidine-containing peptide. The coding sequence of the affinity 
peptide is fused to the protein of interest along with the sequence of a specific cleavage site. Such fusion 

30 proteins can then be purified by taking advantage of the binding of the histidine-containing peptide, or 
affinity tail, to the affinity matrix. After purification of the fusion protein, the affinity tail is split off at the 
designed cleavage site, which is a very difficult and expensive procedure, and the protein of interest is 
purified in a final step. See, for example, Smith et al Biol. Chem. 263, 721 1 (1988). In general this technique 
is not suited for separating the desired protein from other proteins which inherently include one or more 

35 metal-binding amino acids, particularly since many protein impurities are likely to contain multiple exposed 
histidines. However, if the affinity tail contains 6-8 or more histidines, this method can be useful. See 
Hochuli et at, Biotechnology, 6. 1321 (1988). 

Thus, it would be desirable to have a method for protein and polypeptide purification which is 
applicable to all recombinant proteins and polypeptides and which is effective in separating the desired 

40 protein from other proteins which inherently include one or more metal-binding amino acids. 

It has now been discovered that the binding of a protein or polypeptide to an immobilized-metat affinity 
matrix can be enhanced to an extent which is significantly greater than that of native proteins and 
polypeptides which inherently include one or more metal-binding amino acids as well as fusion proteins and 
polypeptides. Such enhanced affinity is effected by engineering into such protein or polypeptide a metal- 

45 chelating amino acid sequence within the primary sequence thereof and at a specific segment of the 
secondary structure. 



SUMMARY OF THE INVENTION 

50 

The present invention is directed at variant proteins and polypeptides having an enhanced affinity, i.e., 
greater binding strength, for metal affinity matrices. The present invention resides in engineering, preferably 
utilizing recombinant DNA methodologies, one or more specific metal-chelating amino acids into a protein 
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or polypeptide, the specific sequence depending on the particular metal-binding amino acids contained in 
the sequence and the secondary structure associated with the portion of the protein or polypeptide which 
will include such sequence, 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates the elution profile of a crude refold mixture of A-1H8H12 insulin-like growth factor-1 
(IGF1) modified according to the teachings of the present invention. 

Figure 2 illustrates the elution profile of a mixture of folded species of A-1H8H12 insulin-like growth factor 
-1 (IGF1) modified according to the teachings of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 



As utilized herein, the term "metal-chelating amino acid sequence" means a sequence of amino acids 
which includes at least two metat-binding amino acids stereochemically arranged such that a simultaneous 
two-site attachment may be formed with an immobilized metal. Where a sequence of amino acids includes 
a metal-binding amino acid, such amino acid will form a coordination bond with the immobilized metal of 
the immobilized-metal affinity matrix, thereby resulting in a ternary "metal complex" (immobilizing ligand + 
metal + peptide). However, if such sequence includes two or more metal-binding amino acids which are 
properly oriented, simultaneous coordination with the transition metal by both amino acids results in a 
"metal chelate". Such chelation will result only when proper stereochemical factors and conformational 
constraints lead to favorable enthalphy and entropy effects. The present invention provides variant proteins 
and polypeptides which contain a metal-chelating amino acid sequence and, therefore, manifest enhanced 
affinity for resins. 

The metal-chelating sequence can be represented by the formula: -A-B^-Cy-Dz E wherein A and E are 
independently metal-binding amino acids selected from the group consisting of histidine and aspartate 
provided that at least one of either A or E is histidine. B. C and D are amino acids, and x, y and 2 are 
integers from 0 to 3 depending on the secondary structure of the surface exposed portion of the protein or 
polypeptide molecule which includes the metal-chelating or sequence-containing site and the particular 
metal-binding amino acids utilized in the metal chelating sequence. The sum of x. y and z in combination 
with the secondary structure of the sequence-containing site determines whether the stereochemical 
requirements are met in order for A and E to simultaneously bind to the immobilized metal and thus form a 
chelate. Where the metal-binding amino acids are independently histidine or aspartate, and the metal- 
chelating sequence is to be engineered into an a-helical segment of the protein or polypeptide, x + y + z 
Is equal to 3. Where such segment is a ;3-hairpin turn, x + y + 2 is equal to 2 and where such segment is 
a ;S-strand, x + y + z is equal to 1. 

Where the native protein or polypeptide lacks an accessible or available metal-binding amino acid at 
the proper position, the entire sequence must be engineered into such protein or polypeptide at such 
position. However, where the protein or polypeptide includes an accessible metal-binding amino acid at the 
proper position, e.g.. on an a-helix, only one additional metal-binding amino acid is required to be 
engineered into such protein or polypeptide in order to effect the proper sequence. 

The proper position for placing the metal-binding amino acids is readily determined for those proteins 
and polypeptides with known structures. Where the structure has not been determined, the primary 
sequence can be determined by well known methods and the secondary structure can be usefully predicted 
as demonstrated below, utilizing methods well known in the art. 

Proper positioning may be determined, for example, as defined below. Calculations are based on the 
following facts for a desired metal chelate: 1) that the donor atoms in the protein llgands are matched to the 
particular metal ion or metal complex; 2) that the two or more metal-binding atoms easily satisfy the specific 
geometric requirements of the metal; 3) that the chelating form of the protein ligand is conformationally 
constrained (relatively inflexible or rigid). 

Of the natural amino acids, only the side chains of cysteine, histidine. aspartate and glutamate have 
significant binding strength in aqueous solutions for divalent first row transition metals at neutral pH. Thus, 
cys > his » asp, glu > other amino acids 

For a cis disposition of llgands binding to Cu'^* (similarly for Ni^ . VO^ and Zn^ ). X-ray crystallog- 
raphic data for metal complexes show that typical copper-nitrogen bonding parameters are Cu-N = 1.98- 
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2.02 A and N-Cu-N = 80° -100°. X-ray crystallographic data for proteins show three commonly observed 
secondary structural features: a-helices, jS-strands and ^-hairpin turns. These structured regions at least 
partially fulfill the requirement of conformational constraint. Typical conformational values for a-helices (0 = 
-57*, xl^^ - -47\ o) = 180'), j9-strands (0 = -139\ = +135°. a> =180*) and j8-hairpin turns (Type T. 

5 Type II ) were used. Geometric searches of energetically acceptable sidechain conformations for his and 
asp residues were carried out to find which amino acid sequences coupled with corresponding secondary 
structures could provide a bidentate chelating site for Cu^* with the above distance and angular constraints. 
Only short range chelating intereactions were considered; i.e., the number of intervening residues between 
the binding residues was 0 to 4. The results of the calculations are shown in the following table; ( + ) shows 

10 when chelation can occur and (-) shows when chelation cannot occur. The nature of the intervening residues 
("X") is relatively unimportant; the modeling demonstrated that the steric size, the hydropathicity and the 
charge of the sidechains of these residues play only minor or secondary roles in determining the strength of 
the metal-chelating peptide interactions. 
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Once regions of regular secondary structure have been identified, it is necessary to determine which 
residues in these regions are sufficiently exposed on the surface of the protein so that they could easily 
bind to immobilized metals. The periodicity of the hydropathy over the region of interest was used as a 
guide in finding the exposed residues. Although a number of hydropathy scales have been defined, the one 
most useful in the present application is the scaie based on the degree to which a particular amino acid 
residue is buried or exposed based upon proteins whose structures are determined by X-ray crystallog- 
raphy, see (Kideva et al, J. Protein Chem., 4, 23, (1985); Table III, property 4. 

Of- Helix For the entire helical region of Interest, the hydropathic moment (direction and magnitude) was 
calculated using a pitch of 18 residues per 5 turns (100 '/residue). If the hydropathic moment was 
sufficiently large (>|0.3|), the residues were then classified into three equally populated categories; exposed, 
buried and borderline. 

jg-Strand For the entire ^-strand region of interest, the hydropathic moment was calculated using a pitch 
of 2 residues per turn (1 80 ' /residue). This is an easy calculation for ;Q-structures because each residue is 
either "up" or "down". In this case, the residues were classified into two equally populated categories: 
exposed and buried. 

j3-Hairpin Turn Because these two-residue turns are supersecondary structures, there is little need for 
further calculations once the residues in the turn have been identified. The residues suited for metal 
chelation are the two residues on either side of the hairpin turn. These turns occur most frequently on 
exposed surfaces of proteins with the turn residues and adjacent residues exposed. 

Having determined .the proper position for placing the metal-binding amino acids, the variant proteins 
and polypeptides may be prepared according to techniques well-known to those skilled in the art. Such 
variants may include a single metal-chelating site as well as a plurality of such sites depending on the 
desired affinity for a particular immobilized metal resin. 

The variant proteins and polypeptides of this invention may be prepared by chemical synthesis. 

4 
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However, because proteins and polypeptides possessing secondary structures are generally large mol- 
ecules, it is preferred to prepare thenn by recombinant DNA technology. This can be done utilizing 
conventional means to construct a gene encoding the desired variant protein or polypeptide having the 
desired metal-chelating sequence in the proper position. A convenient method of constructing the variant 
proteins and polypeptides is by conventional ollgonucleotide-directed site-specific mutagenesis of the 
starting gene. The mutated gene is then cloned into an appropriate vector which vector subsequently is 
utilized to transform a suitable expression host, such as bacteria (e.g., E. coli or Pseudomonas), yeast (e.g., 
S, cerevisae), or mammalian cells (e.g., C127 or CHO). The variant protein or polypeptide is then expressed 
in a conventional manner and recovered as described below. 

The variant proteins and polypeptides thus produced will typically have only one to two modifications in 
the primary sequence. Such small modifications are expected to, and typically do, result in a variant protein 
or polypeptide which retains the biological properties of the native protein or polypeptide. Furthermore, such 
modifications are not expected to and do not typically affect the antigenic characteristics thereof. 



DESCRIPTION OF PREFERRED EMBODIMENTS 



EXAMPLE 1 



VARIANT SOMATOTROPIN 



This example illustrates the present invention as applied to a protein, namely, somatotropin which has 
two available metal-binding amino acids, wherein at least one additional metal-binding amino acid is 
engineered thereinto to effect a metal-chelating sequence. This example also illustrates application of the 
present invention to a protein wherein the structure is known. 

Somatotropins are naturally occurring proteins found in animals and, because of their effect on animal 
growth are commonly referred to as "growth hormones". Native bovine and porcine somatotropins include 
histidine residues at positions 19, 21 and 169. Hisig and HiSisa are both on a-helial segments and are 
exposed. His2i is buried. Metal-binding amino acids were engineered into bovine and porcine 
somatotropins, according to the following procedure, to produce variant somatotropins having a metal- 
chelating site. Enhanced affinity for immobilized-metal affinity matrices is shown in Table 1 for the variants 
produced according to this invention. 



Variant Genes 



The residues at the positions indicated in Table 1 of the BGH and PGH structural genes described in 
Seeburg et aL, DNA, 2, p. 37, (1983), are changed by oligonucleotide-directed, site-specific mutagenesis. 
Oligonucleotide mutagenesis primers are synthesized on an Applied Biosystems DNA Synthesizer in 
accordance with the procedures set forth by the manufacturer. Applied Biosystems, Inc. (Foster City. OA). 
The sequence of the mutagenesis primers are: 
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15 



BST 


TTTGAAGGTGTCA TG AGCCAGCTGAT 


BST 


TGCCCAQC AT GGGGGGTG 


BST 


AAGATGCTGAGCA T GAAGAACAGCG 


BST 


GTCGTCACT AT GCATGTTTGTG 


ABST 


TGCCCAGC AT GGGGGGTG 


ABST 


GTCGTCACT AT GCATGTTTGTG 


BST 


CTGAGCACGA T GAACAGCGT 


BST 


GTTGGTGAAGAC ATQ GCTGAGGAACTG 


ABST 


CTGAGCACGA T GAACAGCGT 


ABST 


CTGAGCACGA T GAACAGCGT 


PST 


GCTAACGCTGttC A TCGGGCCCAGCAC 


ABST 


ATGAACAGCGTTA TG GAATAGACCAGA 


ABST 


ACCGTAGTT G T G'GAGCAGCGCGT G GTCACTGCG 


ABST 


GGTGCGCTCAAA A T G TTTGAAGGTGT G 
AGCAGCCAGCTG ~ 


ABST 


ATGCAGGTCCTT GT GGAAGCAGGAGA 


ABST 


GACCCTCAGGTA GTG CTCCGTCTTATG 


PST 


AAGAAGGACCTG AG CAAGGCTGAGACA 



20 

The underlining indicates the codon which changes the native residue to the desired residue. 

The BGH gene used as template DNA consisted of the BGH gene described in Seeberg et al cloned 

into the M13mp18 vector (Bethesda Research Laboratory, Gaithersburg, MD) as a EcoRI/Hindtli fragment. 
25 Also used as a template is the N-alanyl, valine (126) BGH gene described in European Patent Application 

No. 193,515, published September 3, 1986, cloned into the M13mp18 vector as an EcoRI/Hindll! fragment. 

The PGH gene used as template DNA as the N-alanyl PGH gene described in European Patent Application 

No. 193,515 cloned into the M13mp19 vector (BRL) as an EcoRI/Hindlll fragment. Before the mutagenesis 

of the PGH gene at the desired position, the 5 end of the gene was mutagenized to create an Ncol site in 
30 order to facilitate later subcloning into the expression piasmid. The primer used for this mutagenesis was 

synthesized as those above and has the structure of 

5'-CAGTGAATTCTCCATGGCCTTCCCAGC-3' 

The mutagenesis procedure for the Ncol site addition is described by Kunkel. (Proc. Natl, Acad. Sci., 82, p. 
422 [1985]). All restriction enzymes and modifying enzymes (T4 DNA Itgase and polynucleotide kinase) are 

35 purchased from New England Biolabs (Beverly, MA) and used according to the manufacturer's directions. 

The mutagenesis is carried out using the Amersham (Arlington Heights, IL) Oligonucleotide-directed in 
vitro Mutagenesis System, according to instructions of the manufacturer. Following mutagenesis, positive 
mutant genes are identified by DNA sequence analysis using the SequenaseTM DNA sequencing system of 
United States Biochemical Corporation (Cleveland, Ohio) according to the manufacturer's instructions. The 
mutated genes are then cloned as EcoR!/Hindlll fragments into E. coli expression vector pMON2534. 
Plasmid pMON2534 is a derivative of pBGHgx-i (Seeburg et al) with a tandem lacUV5 promoter inserted at 
the Hindlll site of pBGHex-i as a transcription terminator. The sequence of the tandem lacUV5 promoter as 
an EcoRI fragment is described in Bogoslan et al, (Nucleic Acid Research, 15, 7185 [1987]). The EcoRI 
fragment is converted to a Hindlll fragment by filling in the EcoRI overhangs and attaching Hindlll linkers. 

^5 These manipulations yield the sequences which are. found at the ends of the Hindlll fragment. At the 
upstream end, the Hindlll linker (AAGCTT) ligated to the filled-in EcoRI end (AATTCT...) produces the 
sequence AAGCTT AATTCT...; the CT at positions 11 and 12 represents the right half of the Alul site from 
the original lacUV5 promoter fragment- At the downstream end, the squence produced is 
..AGAATT AAGCTT; the AG at positions -12 and -11 represents the left half of the Alul site from the original 

^0 lacUV5 promoter fragment In addition, pMON2534 has the EcoRI site at the 5 terminus of the ptrp 
fragment of pBGHgx-i and the Hindlll site at the 3' terminus of the tandem lacUV5 promoter/operator 
fragment removed by digestion of the overhand EcoRI and Hindlll ends using SI nuclease and blunt-ended 
ligation with T^ DNA ligase. Plasmid pMON5585 is a pBR327 plasmid containing E. coli recA promoter, a 
G10L sequence, and T? transcription termination sequence as described in European Patent Application 

55 Number 241. 446, published October 14, 1987. The mutant BGH and PGH genes cloned into the 
expression plasmids are inserted into E, coli strain W3110 (ATCC ^39936). 



6 
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Cell Lysis and Isolation of inclusion Bodies 

After thawing an appropriate amount of frozen cell paste at 5* C. 120 g of the cell paste was carefully 
suspended in 480 mL cold water using an Ultra Turrax stirrer. The chilled cell suspension was passed 4 

5 times through a pre-cooled Manton Gaulin homogenizer set at 420-560 Kg/sq. cm. pressure. The resulting 
suspension of lysed cells was subjected to uitracentrifugation at 50,000 Xg (25,000 rpm in 45TI rotor) for 35 
minutes using a Beckman Model L8 centrifuge. The clear supernatant liquid was poured off, and the 
remaining brownish pellet was vigorously washed with a small stream of water in order to remove the top 
slimy layer of unwanted cell debris. The pellet was resuspended in water and subjected a second time to 

w uitracentrifugation and washing. The remaining material was mechanically scraped^ out of the centrifuge 
tubes and combined to yield 4.7 g of damp inclusion bodies which were stored at -80* C for future use. 



Oxidation and Folding of Somatotropin 

15 

A 4.0 g mass of Inclusion bodies was suspended in 300 mL cold water using an Ultra Turrax stirrer. The 
volume of the suspension was increased to 375 mL by the addition of more water. Then 425 mL cold, 
freshly prepared, deionized urea solution (7.5 M) was added to the suspension to yield a mixture about 4M 
in urea. With good stirring, the pH was adjusted to 11.3 by the dropwise addition of 2.5 M NaOH solution. 

20 During the NaOH addition, most of the suspended inclusion bodies dissolved, yielding a light yellow 
solution. This solution was vigorously stirred in an open container at 5*C for 48-72 hours in order to effect 
the refolding of the desired protein. On several occasions cysteine (9.7 mg, 0,1 mM) was added to the 
mixture which shortened the refolding time by about half. In order to remove residual Insolubles. the 
refolded mixture was subjected to uitracentrifugation at 50.000 xg (25,000 rpm in Beckman 45TI rotor). The 

25 clear yellow supernatant liquid was decanted off and then prepared for purification or stored at -20 C. A 2 
ml sample of the crude refold mixture was tested on a small copper-loaded metal-affinity column to 
determine if the binding was strong enough to be useful. If so, a preparative metal-affinity column was used 
for purification; otherwise, ion-exchange chromatography was used. 

30 

Preparation of Functionalized Resin 

Trisacryl GF2000M was repeatedly washed with distilled water to remove all buffers and preservatives 
and then dried by suction. About 100 g (-100 mL) of this slightly damp matrix was suspended in a solution 

35 containing 80 mL diglyme and 100 mL freshly prepared 1.4 M NaOH solution. ^Finally. 100 mL 
diethyieneglycol diglycldylether was added, and the mixture was gently stirred at 35 'C for 16 hours. 
Purified diethyieneglycol diglycldylether was prepared by the literature method of Gu, Ideda and Okahara 
(Synthesis, 649, [1985]). The activated matrix was washed with diglyme/H20 (50/50) and then repeatedly 
washed with distilled water to remove the excess epoxide and base. The washed, suction-dried matrix was 

40 shown to have 70 micromoles active epoxide groups per mL resin. The activated resin was stored at 4 C 
and generally used within 24 hours of preparation. 

Immobilization of Chelate 

45 

The activated Triascryl was washed with distilled water and dried by suction. About 100 g (-100 mL) of 
this activated gel was suspended in 100 mL 1.0 M Na2NH(CH2CO)2 solution, which was adjusted to pH = 
10.5-11.0. This mixture was gently stirred at 65*0 for 24 hours and then repeatedly washed with distilled 
water to remove excess ligand. The functionalized resin was stored in ethanol/water (25/75 v/v) at 4 C until 

50 ready for use. Titration with thiosulfate showed the absence of epoxide groups, so capping with 
ethanolamine was deemed unnecessary. Ten milliliters of suction-dried gel was saturated with excess 50 
mM Cu(C104)2 and then carefully washed with 100 mL distilled water, 100 mL 50 mM imidazole (pH = 7.0) 
and finally 100 mL H2O. Then the bound copper was removed with an excess of 50 mM Na2H2EDTA 
(pH=7.0). Using standardized copper-EDTA solutions for comparison, the total copper content was 

55 photometrically determined to be 0.43 mitlimoles. 



Elution Protocol for Metal-Affinity Column 
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A glass column (2.2 x 21 cm, 80 mL) was packed with carefully washed IDA-Trisacryl gel and then 
charged with 400 mL 50 mM Cu(C104)2 (pH = 4.5). The functionalized gel nearly quantitatively absorbed 
the copper Ions. This copper column was washed with 100 mM NaCI, and then it was equilibrated with the 
release buffer (100 mM N„-acetylhistidine, 500 mM NaCI, 50 mM NaH2P04, pH = 7.0) and finally 

5 equilibrated with the loading buffer (1 mM N„-acetylhistidine, 500 mM NaCI, 50 mM NaHaPO*. pH = 7.0). 
The filtered crude refold mixture (-800 mL) was pumped onto the column at 5 mL/min and then the column 
was washed with 240 mL of the loading buffer. The column was developed for 500 minutes at a flow rate of 
2.5 mL/min at ambient temperature (23*) using a linear gradient of N„-acetylhistidine (1->100 mM). The 
eluate from the column was continuously monitored at 280 nm (0.2-2.0 AUFS) using a Kratos Model 757 

10 spectrometer equipped with a 3 mm path-length cell. Fractions (25 mL) were collected using a Gilson 
Model 202 fraction collector. After the run was completed, the column was stripped with 50 mM 
Na2H2EDTA (pH = 7.0) and then 50 mM NaOH. The column was ready for regeneration after washing with 
100 mM NaCI (240 mL), loading buffer (240 mL) and finally 100 mM NaCI (240 mL). The fractions 
containing the somatotropin were collected and combined to yield 250 mL solution which contained 1.7 

75 mg/mL protein. Typical analyses by analytical reverse phase HPLC (Vydac C18 column, H2O/CH3CN + 
0.1% CF3CO2H) of Hisi5-avbST at this stage showed the following (column A). 



Protein 


A 


B 


Somatotropin monomer + isoforms 
Somatotropin-related oligomer 
Extraneous protein 


96.6% 
2.5% 
0.9% 


98.2% 
1 ,5% 
0.3% 



Sacrificing the back 15-20% of the somatotropin peak reduced the oligomer content to 1.0-1.5%. Con- 
centrating the purified somatotropin and reloading it onto the same column (cleaned and regenerated) 
yielded slightly purer product (column B above) with 94% recovery. 

Variations on this elution protocol were used, in our early experiments. Pharmacia Chelating Sepharose 
68 was used; this gel was difficult to clean and flow rates were restricted. For larger column columns (0.5- 
2.0 L), we also used Pharmacia Chelating Sepharose Fast Flow. Although the chromatographic resolution 
was reduced somewhat, we sometimes found it convenient to use imidazole (0.5 ">45 mM) as the release 
buffer or to use 1.0M NaCI instead of 0.5M NaCI. The stronger binding variants, HisaeHisso-avbST and 
Hisi 1 Hisi5-avbST, were more easily purified using imidazole (100 mM) as the release agent. 

35 

Ultrafiltration, Concentration and Lyophtlization 

The purified somatotropin solution (250-500 mL) was concentrated to 30-40 mL in volume using a 400 
mL stirred Amicon ultrafiltration cell equipped with a YM10 membrane. If the metal-affinity column was 
used, 40 mg solid Na2H2EDTA '2^12 0 was dissolved in the protein pool prior to concentration. Using 5mM 
Na2C03 (pH = 10.0), the protein solution was diluted to 400 mL in volume and then reconcentrated to 30- 
40 mL under N2 pressure. Three more cycles of dilution and reconcentration were carried out, yielding 
about 30 mL of purified somatotropin in carbonate buffer. The liquid was placed in a lyophilization flask 
along with enough water to increase the volume to 100 mL. The solution was frozen and placed on a Virtis 
Lyophilizer (Freezemobile 12) overnight. The resulting fluffy white solid was weighed and stored in a sealed 
container at -20 * C. 



50 
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This example illustrates application of the present invention to a protein, namely, somatomedin C, which 
does not contain an available metal-binding amino acid, wherein two metal-binding amino acids are 
5 ' engineered thereinto to effect a metai-cheiating sequence. This example also illustrates application of the 
present invention to a protein wherein the 3-dimensional structure has not been determined. 

Somatomedin C, or "insulin-like growth factor-1" as utilized herein refers to IGFI. It is contemplated 
that IGF2, as well as proinsulin and insulin, can be modified in substantially the same manner with 
substantially similar results. For example, Alas and Aspi2. which are predicted according to the procedures 
10 set forth above to be located at an a-helical segment of IGF1. were both replaced with histidine according 
to the procedure set forth below. Enhanced affinity for the HissHisia-IGFI variant Is shown in Table 2. The 
residue at position 16, rather than at position 8, can also be replaced with a histidine since the a-helical 
segment is predicted to extend to position 1 7. 

75 

Choice of Bacterial Strains and Starting Plasmids 

Strains used were JM101 (supE. thi, (lac-proAB), [f', traD36, proAB, laclqZ M15]) (C. Yanisch-Perron, J. 
Vieira, J. Messing, Gene, 33, 103. [1985]) and BW313 (dut, ung, thi-1, relA, spoTI/p' lysA). (T, A. Kunkel. 

20 Proc. Natl. Acad. Sci.. 82, 488, [1985]) Plasmid pMON2464 consists of the replicon of pBR327 (L 
Covarrubias, L. Cervantes. A. Covar-rubias, X. Soberon, A. Blanco, Y. M. Kupersztoch-Portnoy. F. Bolivar, 
Gene. 13, 25, [1981]) into which an expression cassette has been inserted in place of a portion of the 
tetracycline gene. The promoter used is derived from the rec A gene of E. coli; the ribo-some binding site 
used is from gene 10 of phage T7; (P. O. Olins, C. S. Devine, S, H. Rangwala, K. S. Kavka. Gene. 73, 227, 

25 [1988]) the gene encodes alanyl-IGFI with histidine substitutions at positions 8 and 12. Downstream from 
the gene is a sequence of about 500 base pairs from pEMBLIS. (L Dente, C. Cesareni, R. Cortese, Nucleic 
Acids Res. , 11 , [1983]). This sequence contains the origin of replication of the single stranded phage f 1 . In 
cells infected with phage R40a, single stranded plasmid DNA is packaged into phage particles. In 
pMON2464. the EcoRI site and the PstI site in the sequence of the beta-lactamase gene have been 

30 eliminated by in vitro procedures. 



Construction of Plasmids 



35 Alanyl-IGF1 was found to be produced from pMON2446 at a level of about 10 percent of total cell 
protein. The protein could be recovered easily from insoluble inclusion bodies and could be refolded into its 
active conformation. The production of 1GF1 variants containing metal binding sites was made by 
constructing alanyi-IGFI variants which differed in coding sequence from pMON2446 only in the codons 
required to specify the amino acid changes. 

40 Method A . The construction of pMON2464 which encodes alanyl-IGFI with histidine replacements at 
position 8 an^ 12 is now described. The DNA between the Ncoi and PstI sites in pMON2363 was replaced 
with complementary synthetic oligomers which encode the N terminal 16 codons of the alanyi-IGFI gene. 
The DNA of pMON2363 was used because it contains the gene for an IGFI variant which has nine more 
bases of DNA between the Ncol and PstI sites than does pMON2446. Substitution of the DNA of 

45 pMON2363 between the Ncoi and PstI sites with synthetic DNA which is of shorter length permitted the 
identification of recombinant plasmids which encode the desired alanyl-IGFI variants. One microgram of the 
DNA of pMON2363 was treated with the restriction enzymes Ncol and PstI at 37'*C for at least 2 hours in 
the following buffer: 10 mM Tris'HCi (pH 7.5), 5 mM MgCb, 150 mM NaCI. To the restriction enzyme 
reaction mixture NaOAc was added to a final concentration of 300 mM in a final volume of 0.3 mL. The 

50 sample was extracted with 0.1 ml each of water saturated phenol and of chloroform. The aqueous phase 
was removed, and 0.7 mL 95% ethanol was added to it and mixed. 

Synthetic oligonucleotides were produced by phophodiester chemistry using an Applied Biosystems 
DNA synthesizer. The sequences of these oligonucleotides is shown below: 

55 #1 5' CATGGCAGGACCAGAAACTCTTTGCGGCCATGAACTTGTTCATGCTCTGCA 3' 

#2 3' CGTCCTGGTCTTTGAGAAACGCCGGTACTTGAACAAGTACGAG 5' 

12 
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These oligonucleotides were passed through a duPont Nensorb column to remove salts. Purification of 
the oligonucleotides from a polyacrylamide gel was not necessary. Approximately 1000 picomoles of the 
oligonucleotides were resuspended in water. One hundred picomoles of each of the complementary 
oligonucleotides were mixed in a volume of 50 microliters in the following buffer: 6.6 mM Tris (pH 7.4), 6.6 

5 mM MgCl2. 6.6 mM NaCI and 5 mM dithiothreitol. The sample was placed in boiling water which was 
allowed to cool to room temperature to permit annealing of the oligonucleotides. Ten picomoles of the 
annealed oligonucleotide mixture was added to an aliquot of one half of the Ncol and PstI treated 
pMON2363 DNA in ethanol. Both this sample and the other half of the pMON2363 DNA were chilled, and 
the DNA was collected by precipitation. The dried pellets were resuspended in 20 microliters of ligation 

10 buffer: 25 mM Tris (pH 8.0). 10 mM MgCb. 0.2 mM spermidine. 1 mM dithiothreitol and 1 mM ATP. To this, 
10 units of T4 DNA ligase was added, and the reaction mixture was incubated overnight at 15 C. 

Method B. Single stranded DNA of pM0N2464 was isolated. A culture of E. coli strain BW313 harboring 
pMON2464 was grown in 2XYT medium (16 grams tryptone. 10 grams yeast extract, 5 grams NaCI per 
titer) with the addition of 200 micrograms per milliliter of ampicillin. At a Klett value of 110, phage R408 

15 (Stratagene) was added to a final concentration of 10(9) phage per milliliter. At the same time uridine was 
added to a final concentration of 0.25 microgram per mL. The culture was allowed to grow with shaking at 
37' C overnight. Four to six milliliters of culture were subjected to centrifugation to remove cells. To the 
supernatants were added one fourth volume of phage precipitation buffer (2.5 M NaCi. 10% w/v polyethyl- 
ene glycol M6000, 0.15 mM EDTA (pH 7.0), 10 micrograms per milliliter pancreatic RNase). These samples 

20 were kept at 4' C overnight. The phage from one milliliter of culture supernatant were collected by 
centrifugation and resuspended in 50 microliters protease K digestion buffer (10 mM Tris HCI pH 7.4. 0.1 
mM EDTA, 0.2% sarkosyl and 0.05 mg/mL protease K). The samples were incubated at 65* C for one hour 
and then chilled on ice. NaCI was added to a final concentration of 400 mM. The samples were .yortexed in 
the presence of one half volume each of water saturated phenol and chloroform. The aqueous'phase was 

25 removed and the nucleic acid was precipitated with two volumes of cold ethanol. The dried pellets were r. 
resuspended in 10 microliters of water per four milliliters of original culture superna-tant. ^ 

The single stranded DNA was derived predominantly from the plasmid rather than from. the phage - 
R408. A low level of uracil incorporation results from the growth of the plasmid in strain BW313. This 
permitted a selection in favor of an in vitro synthesized complementary strand which contains no uracil. To 

30 prime synthesis of this strand. a~s7nthetic DNA oligonucleotide was used. The sequence of this 
oligonucleotide (5'GCAAACGTGCTGCAGAGCATGAACAAG 3') differs from the complement of the se- 
quence of the single stranded template at the position of codon 16 of the 1GF1 gene. The sequence of the 
oligonucleotide specifies histidine, whereas the template specifies phenylalanine at that position. 

Fifty picomoles of this oligonucleotide was treated with polynucleotide kinase in the presence of 25 mM 

35 Tris (pH 8.0). 10 mM MgCb, 0.2 mM spermidine, 1 mM dithiothreitol and 1 mM ATP for 30 minutes at 
37' C and then 5 minutes at 65' C. Ten picomoles of the oligonucleotide were mixed with 4 microliters of 
the template prepared as described above. These were brought to a final volume of 10 microliters in Hin 
buffer: 6.6 mM Tris (pH 7.4). 6.6 mM MgClg. 6.6 mM NaCI, 5 mM dithiothreitol. Tubes containing the 
samples were suspended in a beaker of water which was brought to a boil and allowed to cool to room 

40 temperature. This permitted the annealing of the oligonucleotide to the template. To the cooled mixture was 
added nine microliters of NTP mix: Hin buffer, 1 mM of each of the four deoxynucleotide triphosphates, and 
1 mM rATP. To these samples were added 3 units each of T4 DNA ligase and the Klenow fragment of DNA 
polymerase I of E. coli. Both of these enzymes were obtained from Boehringer Mannheim. The samples 
were then incubated at 15* C overnight. 

45 

Introduction of Plasmid into Cells and Cell Screening 



A culture of E. coli JM101 cells were made competent to take exogenous DNA. The cells were 
50 collected from a culture growing at 37" C in LB medium. They were then resuspended in 1/2 culture volume 
with 50 mM CaCb. After storage on ice for 30 minutes, the cells were collected by centrifugation, and the 
cell pellets were resuspended in 1/10 culture volume in 50 mM CaCb- After one half hour incubation at 
4' C. the samples were incubated at 42° C for one minute. One milliliter of L broth was added, and then the 
samples were incubated at 37* C for 2 hours. The cells were collected by centrifugation and spread^ on agar 
55 plates containing 200 mg/mL of ampicillin. Colonies which grew after overnight incubation at 37' C were 
picked into liquid broth also containing 200 mg/mL ampicillin. Plasmid DNA was isolated from the cells in 
these cultures by standard methods and subject to analysis by polyacrylamide gel electrophoresis of the 
DNA which had been treated with restriction endonucieases. Those plasmid DNAs which were found to 
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contain DNA restriction fragments whose size indicated the presence of the synthetic DNA in place of the 
parental DNA were chosen as candidates for the desired recombinant. The DNA of these plasmids was 
subjected to DNA sequence analysis by standard methods to confirm the presence of the desired sequence 
between the Ncol and PstI restriction sites. 

5 

Bacterial Fermentation 



E, coli strain W3110 II-4 harboring plasmid pMON2464 was used to produce a variant of alanyl-IGF1 
10 which contains histidine substitutions at positions 8 and 12, A transformant of W3110 11-4 which harbors the 
plasmid pMON2464 was used to start an overnight culture in L broth containing 200 mg/mL of ampiciilin. 
This was used to inoculate a fermentor culture. The growth hiedium contained the following: KOH, H3PO*, 
(NH4)2S04., MgSO^.. trace metals and Alimet Liquid dextrose was used as carbon source and the residual 
dextrose concentration in the fermentor was maintained between 0.05% and 0.25% using a concentrated 
75 dextrose feed strategy. No antibiotic was added to the fermentor. Fermentation run parameters were as 
follows: 37° C, 100 rpm agitation, air sparge rate of 10 liters per minute, 0.35 Kg/sq. cm. back pressure and 
pH set-point controlled at 7.0 with ammonium hydroxide. When the culture had grown to optical density 
(550 nm) of 20, the temperature was shifted from 37" C to 33° C and maintained for the duration of the run. 
When the culture reached an optical density (550 nm) of 42, nalidixic acid was added to a final 
20 concentration of 25 ppm to induce expression of the IGF1 variant gene from the rec A promoter on 
pMON2464. The cells were then harvested, frozen and stored at -80' C. 



Cell Lysis and Isolation of Inclusion Bodies 

25 

After thawing an appropriate amount of frozen cell paste at 5°C, 11 Og of the cell paste was carefully 
suspended in 480 mL cold water using an Ultra Turrax stirrer. The chilled cell suspension was passed 4 
times through a pre-cooled Manton Gaulin homogenizer set at 420-560 Kg/sq. cm. pressure. The resulting 
suspension of lysed cells was subjected to ultracentrifugation at 50,000 Xg (25,000 rpm in 45TI rotor) for 35 
30 minutes using a Beckman Model LB centrifuge. The clear supernatant liquid was poured off. and the 
remaining brownish pellet was vigorously washed with a small stream of water in order to remove the top 
slimy layer of unwanted cell debris. The pellet was resuspended in water and subjected a second time to 
ultracentrifugation and washing. The remaining material was mechanically scraped out of the centri-fuge 
tubes and combined to yield 2.6 g of damp inclu-sion bodies which were stored at -80 ' C for future use. 

35 

Oxidation and Folding of IGF1 

Method A. A 1.3 g mass of inclusion bodies was suspended in 80 mL cold buffer (6 M urea + 25 mM 

40 Tris, pH = 970) using an Ultra Turrax stirrer. Dithiothreitol (120 mg) was added to the mixture in order to 
reduce and solubilize the inclusion bodies. This mixture was stirred at 5-10' C for 10 minutes, and then 160 
mL of cold Tris buffer (25 mM, pH = 9.0) was added. The pH of this mixture was rapidly raised to 11,0 by 
dropwise addition of 2.5 M NaOH. The mixture was stirred for about 1 minute and then the pH was rapidly 
reduced to 9.5 by dropwise addition of 6 M HCl. This solution was vigorously stirred in an open container at 

45 5 "C for 16 hours in order to complete the folding process. In order to remove residual insolubles, the 
refolded mixture was subjected to ultracentrifugation at 50,000 Xg (25,000 rpm in Beckman 45tl rotor). The 
clear pale yellow superna-tant liquid was decanted off and then prepared for purification by adding 1.4 g 
NaCI and adjusting pH to 8.5. 

Method B. A 2.6 g mass of inclusion bodies was suspended in 100 mL cold buffer (6 M urea + 25 mM 

50 NaaBOs, pH^ 9.5) using an Ultra Turrax stirrer. Dithiothreitol (150 mg) was added to the mixture in order 
to reduce and solubize the inclusion bodies. This mixture was stirred at lO'C until most of the inclusion 
bodies had dissolved (-10 minutes), and then 1100 mL of cold borate buffer (25 mM, pH = 9.5) was 
added. The pH of the mixture was checked to make sure that it was 9.5. This solution was vigorously stirred 
in an open container at 5°C until the oxidation was complete (10-48 hours). Sodium chloride (7.0 g) was 

55 dissolved in the refold mixture followed by dropwise addition of glacial acetic acid until the pH was 4.5. The 
mixture was then subjected to ultracentrifugation at 50,000 Xg (25,000 rpm in Beckman 45TI rotor). The 
clear supernatant liquid was decanted off, the pH was adjusted to 8.5 using 2 M NaOH, and then the protein 
solution was stored at 4** G awaiting further purification. 

14 
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Preparation of Functionalized Resin 

Trisacryl GF2000M was repeatedly washed with distilled water to remove all buffers and preservatives 
and then dried by suction. About 100 g (-100 mL) of this slightly damp matrix was suspended in a solution 

5 containing 80 mL digiyme and 100 mL freshly prepared 1.4 M NaOH solution. ^Finally, 100 mL 
diethyleneglycol diglycidylether was added, and the mixture was gently stirred at 35* C for 16 hours. 
Purified diethyleneglycol diglycidylether was prepared by a literature method. (X. Gu, L Ikeda, M. Okahara, 
Synthesis, 649, [1985]) The activated matrix was washed with dlglyme/H2 0 (50/50) and then repeatedly 
washed with distilled water to remove the excess epoxide and base. The washed, suction-dried matrix was 

70 shown to have. 70 micromoles active epoxide groups per mL resin. The activated resin was stored at 4 C 
and generally used within 24 hours of preparation. 

Immobilization of Chelate 



The activated Triascryi was washed with distilled water and dried by suction. About 100 g (-100 mL) of 
this activated gel was suspended in 100 mL 1.0 M Na2NH(CH2C02)2 solution, which was adjusted to pH = 
10.5-11.0. This mixture was gently stirred at 65'C for 24 hours and then repeatedly washed with distilled 
water to remove excess ligand. The functionalized resin was stored in ethanol/water (25/75 v/v) at 4 until 

20 ready for use. Titration with thiosulfate showed the absence of epoxide groups, so capping with 
ethanolamine was deemed unnecessary. Ten milliliters of suction-dried gel was saturated with excess 50 
mM Cu(C104)2 and then carefully washed with 100 mL distilled water, 100 mL 50 mM imidazole (pH = 7.0) 
and finally 100 mL H2O. Then the bound copper was removed with an excess of 50 mM Na2H2EDTA (pH 
= 7.0). Using standardized copper-EDTA solutions for comparison, the total copper content was 

25 photometrically determined to be 0.43 millimoles. 



Elution Protocol for Metal Affinity Column 



30 A glass column (1,6 x 13 cm, 26 mL) was packed with carefully washed IDA-Trisacryl gel and then 
charged with 130 mL 50 mM Cu(CI04)2 (pH 4.5). The functionalized gel nearly quantitatively absorbed the 
copper ions. This copper column was washed with 100 mM NaCl. and then it was equilibrated with the 
release buffer (50 mM imidazole, 500 mM NaCl. 50 mM NaHsPO*, pH = 7.0) and finally equilibrated with 
the loading buffer (0.5 mM imidazole. 500 mM NaCl, 50 mM NaH2P04, pH = 7.0). The clarified crude 

35 refold mixture (-1200 mL) was adjusted to pH = 8.5 and then pumped onto the column at 4.0 mL'min; the 
column was then washed with 80 mL of the loading buffer. The column was developed for 500 minutes at a 
flow rate of 1.3 mUmin at ambient temperature (23*) using a linear gradient of imidazole (0.5 50 mM). 
The eluate from the column was continuously monitored at 280 nm (0.05-0.5 AUFS) using a Kratos Model 
757 spectrometer equipped with a 3 mm path-length cell. Fractions (13 mL) were collected using a Gilson 

40 Model 202 fraction collector. After the run was completed, the column was stripped with 50 mM 
Na2H2EDTA (pH 7.0) and then 50 mM NaOH. The column was ready for regeneration after washing with 
100 mM NaCl (240 mL). loading buffer (240 mL) and finally 100 mM NaCl (240 mL). Two strong binding 
protein peaks eluted off the copper column. The fractions containing the second peak were collected and 
combined to yield 80-100 mL solution which contained 30-40 mg protein. A typical analysis by analytical 

45 reverse phase HPLC (4.6 x 250 mm. Brownlee Cs Aquapore column, H2O/CH3CN + 0.1% CF3CO2H, 210 
nM) of A-iHsHi2-IGF1 at this stage gave the following results. 



Protein 


Analysis 


IGF1 


73% 


tGF1 monomeric isoform 


16% 


IGF1 -related oligomer 


10% 


Extraneous protein 


1% 



55 

In addition to our Trisacryl gel. Pharmacia Chelating Sepharose Fast Flow was used with acceptably good 
results. 
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Elution Protocol for Reverse Phase HPLC 



The purified somatomedin solution (-100 mLy-36 mg total protein) was concentrated to 1.5 mg/mL 
protein concentration using a 100 mL stirred Amicon ultrafiltration cell equipped with a YM2 membrane. Part 

5 of the concentrated sample (12 mL) was filtered (0.2um) prior to injection onto the reverse phase column, 
and the remainder of the sample was frozen for future use. The columr used was an Aquapore R-300 Cg 
reverse phase column (7.0 x 250 mm) distributed by Brownlee Labs. The column was equilibrated with a 
10/90 mixture of acetonitrile-water (0.1% CF3CO2H), and then the protein solution was injected onto it. The 
column was developed for 50 minutes at a flow rate of 3.0 mL/min. at ambient temperature using a linear 

70 acetonitrile gradient until the final solvent composition was 60/40 acetonitrile-water (0.1% CF3CO2H). The 
eluate from the columr was continuously monitored at 280 nm (0.2-2.0 AUFS) using a Kratos Model 757 
spectrometer equipped with an 8 mm cell. Fractions {0.9 mL) were collected using a Gilson Model 202 
fraction collector. Properly folded A-i HsHia-IGFI eluted off the column at 22 minutes ("32% MeCN), its 
companion isoform came off at 25 minutes ("35% MeCN) and several oligomer peaks came off in the range 

75 27-35 minutes. The relevant fractions containing the IGF1 were analyzed using analytical reverse phase 
HPLC. Eight fractions containing the main peak were pooled and shown to contain 13 mg at 98 + % purity. 
Three fractions containing the isoform were pooled and shown to contain 1.6 mg at 95% purity. The column 
was cleaned by washing with 90/10 MeCN-H20 and then reequilibrated with the starting buffer 10/90 
MeCN-H20 (0.1% CF3CO2H). The remaining sample was thawed and similarly purified using the same 

20 procedure. 



Lyophilization 

25 Requisite volumes (0.1-1.5 mL) of the purified IGF1 solutions were pipeted into 2 mL Eppendorf tubes 
and placed in a Savant Vacuum Concentrator (Speedvac, SVC 200H) in order to remove the solvents (H2O, 
CH3CN, CF3CO2H). The purified proteins were obtained as fluffy white solids and stored at -80° C. The total 
yield of purified A-i HsHi2-lGF1 was 26.4 mg; 3.2 mg of the companion isoform was also obtained. 

30 

S-Su!fonation Procedure 



About 1.5 mg A-i HsHi 2-IGF1 (misfolded or properly folded) was dissolved in 1.5 mL sulfonatton buffer 
which contained 125 mM Na2S03. 25 mM Na2S4 06 *2H20' 25 mM H3BO3 and 6 M urea (pH 8.5). The 
35 reaction was allowed to proceed for 3 hours at 25 'C or 12 hours at 5" C after which the reaction mixture 
was filtered (0.2u) and injected onto the reverse phase HPLC column (see above). Despite the formation of 
six additional negative charges, the protein was more hydrophobic and eluted from the column in 29 
minutes (-39% MeCN). The fractions containing the product were handled in the same manner as the 
native protein to yield 1.3 mg pure A-i H3Hi2-IGF1(S03)s. 



Modeling Metal Chelating Sites in Proteins 



The simultaneous interaction of two or more metal-binding sites of a single muitidentate ligand to a 
45 single metal or single cluster of rigidly attached metals is called metal chelation. It is known that properly 
designed chelates bind stronger to a particular metal than analogous non-chelating ligands. (A. E. Martell, R. 
M. Smith, Critical Stability Constants; Plenum Press: New York. 4 vols.[1 975]) Properly designed means (1) 
that the chemical nature of the donor atoms in the ligands are matched to the particular metal ion or metal 
complex; (2) that the two or more metal-binding atoms easily satisfy the specific geometric requirements of 
50 the metal; (3) that the chelating form of the ligand is conformationally constrained (rela-tively inflexible, 
rigid). 

Of the natural amino acids, only the side chains of cysteine, histidine, aspartate and glutamate have 
significant binding strength in aqueous solutions for divalent first row transition metals at neutral pH. (A. E. 
Martell, R. M. Smith, Critical Stability Constants; Plenum Press: New York, 4 vols. [1975]) 
55 cys > his » asp, glu > other amino acids 

For a cis disposition of ligands binding to Cu^* (similarly for VO^*, Ni^*, and Zn^*), X-ray crystallog- 
raphic data for metal complexes show that typical copper-nitrogen bonding parameters are Cu-N = 1.98- 
2.02 A and N-Cu-N = 80"-100\ (G. Nardin. L. Randaccio, R. P. Bonomo and E. Rizzarelli. J. Chem. Soc, 
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Dalton Trans., 369, [1980]. A. Podder. J. K. Dattagupta. N. N. Saha and W. Saenger. Acta Cryst, B35, 53 
[1979]). X-ray crystallographic data for proteins show three commonly observed secondary structural 
features: a-helixes. ;9-strands and turns. These structured regions at least partially fulfill the requirement of 
conformational constraint. Typical conformational values for a-helices {<t> - -57* . = -47 , w = 180 ), (S. 
Arnott. A. J. Wonacott. J. Mol. Biol.. 21, 371. [1966]). T, Blundell, D. Barlow, N. Borkatakoti, J. Thorn-ton, 
Nature, 306 , 281, [1983]) j9-strands (<>^ = -139', ^ = +135", co = 180'). (C. Chothis, J. Mol. Biol.. 75, 
295, [19731) and ;S-hairpin turns (Type T, Type ll') (B. L. Sibanda, J. M, Thornton, Nature, 316, 170. [1985]) 
were used. Geometric searches of energetically acceptable side-chain conformations (J. W. Ponder, F. M. 
Richards, J. Mol. Biol.. 193, 775, [1987]) for histidine and aspartate residues were carried out in order to 
find which amino acid sequences coupled with corresponding secondary structures could provide a 
bidentate chelating site for Cu^* with the above distance and angular con-straints. Only short range 
chelating interactions were considered; i.e., the number of intervening residues between the binding 
residues was 0 to 4. The results of the calculations are shown in the following table; ( + ) shows when 
chelation Is possible, and (-) shows when chelation cannot occur. The nature of the intervening residues 
("X") is relatively unimportant. The modeling showed that the steric size, the hydropathicity and the charge 
of the sidechains of these residues play only minor or secondary roles in deter-mlning the strength of the 
metal-chelating peptide interactions. 



Sequence 


a-Helix 


j3-Strand 


j6-Hairpin 
Turn 


HH 








HxH 






+ 


HxxH 






+ 


HxxxH 


+ 






HxxxxH 








DH 








DxH 








DxxH 






+ 


DxxxH 


+ 






DxxxxH 








HD 








HxD 








HxxD 






+ 


HxxxD 


+ 






HxxxxD 









Locating Secondary Structure 



In the absence of more reliable structural information, regions containing significant secondary structure 
were determined from amino acid sequence information using the prediction algo-rithms of Nagano (K, 
Nagano, J. Mol. Biol., 109 , 251. [1977]), Chou (P. Y. Chou. G. D. Fasman. Adv. EnzymoL. 47 , 45, [1978]). 
Garnier (J. Gamier. D.~j7 Osguthorpe. B. Robson, J. Mol. Biol., 120. 97, [1978]) and Wako (H. Wako. N. 
Waito, H- A. Scheraga, J. Protein Chem.. 2 , 221. [1983]); a fifth method using homologies to sequences in 
known structures was also used. These five pre-dictive methods were applied to each protein from a series 
of aligned, homologous proteins. Only when there was substantial agreement between all predictions was a 
joint prediction con-sidered to be reliable. The table below shows the joint pre-diction of proinsulin from 
eight mammalian species (human, pig. guinea pig, rat, mouse, horse, cow. dog) and the joint prediction of 
insulin-like growth factor-1 from four mammalian species (human, cow, rat. mouse). The composite 
predictions for proinsulin and IGF1 shows that no j3-structure is reliably predicted. However, two regions 
containing turns (19-23. 39-42) are moderately predicted, and one helical region (8-17) is strongly predicted. 
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55 Identifying Exposed Residues 

Once regions of regular secondary structure have been iden-tified, it is necessary to determine which 
residues in these regions are sufficiently exposed on the surface of the protein so that they couid easily 
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bind to metals. The periodicity of the hydropathy over the region of interest was used as a guide in finding 
the exposed residues. Although a number of hydropathy scales have been defined, the one most useful in 
the present application Is the scale based on the degree to which a particular amino acid residue is buried 
or exposed as determined in proteins whose structures are determined by X-ray crystallography. 

5 (A. Kidera. Y. Konishi, M. Oka. T. Ooi, H. A. Scheraga, J. Protein Chem., 2, 221, [1983]). 

a-Helix. For the entire helical region of interest, the hydropathic moment (direction and magnitude) was 
calculated using a pitch of 18 residues per 5 turns (100* /residue). If the hydro-pathic moment was 
sufficiently large (>|0.3| per residue), the residues were then classified into three equally populated cate- 
gories; exposed ( + ), buried (-) and borderline (0). 

10 The calulations for residues 8-17 In IGF1 and for the analogous region in proinsulin showed that residue 
12 (asp/glu) had the best exposure to the solvent. Similarly, residues 8, 15, and 16 (ala/ser, gin/tyr, phe/leu) 
were exposed, and residues 10, 14 and 17 (leu/leu, leu/leu. val/val) were buried. Residues 9. 11 and 13 
were calculated to lie in an intermediate region, neither fully exposed nor completely buried; of these three 
residues, E9/H9 is more exposed, and Ln/Ln and A13/A13 are more buried. With the exception of residue 

15 16, the helix has good amphlphilic character. On the basis of those calculations, H8Hi2-iGFl and H12H16- 
IGF1 were judged to contain the best metal binding sites. We over-looked several possible problems; (1) 
the proximity of residues 8 and 16 to the ends of the calculated helix, (2) the replacement of a hydrophobic 
residue (pheie) by a hydrophilic histidine, (3) the possibility that residue 8 was part of a G7A8 turn, 

;8-Strand. For the entire ^-strand region of interest, the hydropathic moment was calculated using a 

20 pitch of 2 residues per turn (180° /residue). This is an easy calculation for ^-structures because each 
residue is either '^up" or "down." In this case the residues were classified into two equally populated 
categories: exposed ( + ) and buried (-). 

j3-Hairpin Turn. Because these two-residue turns are super-secondary structures, there is little need for 
further calculationi once the residues in the turn have been identified. The residues suited for metal 

25 chelation are the two residues on either side of the hairpin turn. These turns occur most frequently on 
exposed surfaces of proteins with the turn residues and immediately adjacent residues exposed. 

Owing to the presence of neighboring cysteine, glycine or proline residues, neither predicted turn 
region (19-23. 39-42) in IGF1 indicated a clearly defined, single 2-residue turn, and no good metal binding 
sites were judged to be present. 

30 

Biological Assay 

The biological activity of the 1GF1 variants was assayed by measuring enhancement of myoblast 

35 proliferation in vitro . Rat L6 myogenic cells (D. Yaffe, Proc. Natl. Acad. Sci.. 61, 477. [1968]) were used in a 
cell proliferatTorTaisay. (0. E. Kotts, M. E. White. 0. E. Allen. F. Martin. W. R. Dayton. J, Animal Sci., 64, 
615, [1987]). Previous work has shown that native iGF1 will respond to this assay. (C. E. Kotts, G. A. Baiie, 
Fed. Proc, 44 , 484, [1985]). This assay was used with minor modifications which are briefly described. 
Cells were plated onto 2 cm^ wells (24-well plates. Corning) at 1000 cells/cm^ in Dulbecco's Minimum 

40 Essential Medium (DMEM, Dibco Laboratories, Grand Island, N.Y.) containing 10% fetal bovine serum (FBS. 
Gibco). After 24 hrs, the test medium containing the variant IGF1 (0.1-50 nM) in DMEM plus 2% FBS was 
applied (1 mL/well). Stock solutions of fragments were prepared in 10 mM HCI at a concentration of 10 
m.g/mL Fresh test medium was applied again 24 hours later. After an additional 48 hr, cell number was 
estimated by measuring the content of DNA on each well. DNA content was correlated with cell number 

45 using a standard curve consisting of known numbers of L6 cells as counted on a Coulter counter (Model 
2M, Coulter Electronics, Hiieah, Florida). Controls received DMEM containing 2% FBS and appropriate 
volumes of 10 mM HCI. Positive controls received various concentrations (0.1-50 nM) of recombinant 
human/bovine 1GF1 (Monsanto Co. Lot S105, St. Louis, Mo.) in DMEM plus 2% FBS. All incubations were 
carried out at 37* C, 10% CO2 and 100% humidity. Results are presented as percent increase in cell 

50 number over controls (DMEM + 2% FBS) within each assay and defined as stimulation. Intra-assay 
variation averaged 5,1% (±1.2%) and inter-assay variation was 22.2% among the experiments used in this 
study. 

Single point assays at 0.1. 1 and 10 nM concentration of IGF1 showed the following activities: 

55 
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A complete kinetic analysis of the concentration dependent rates of proliferation for two of tlie above 
proteins gave the following results: 



15 



Protein 




V max 


Native IGF1 
A-1H8H12 IGF1 


2.9 nM 
1.5 nM 


168% 

138% 



20 



The values of effective are the same within experimental error (± factor of 2). The value of V^ax for 
the IGF1 variant shows good maximal activity, although it is somewhat lower than the native protein (±20% 
error). 
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Proteins produced by recombinant DNA technology in bacteria are commonly accumulated in insoluble 
refractile bodies in the cytoplasm of the host cell. To recover the proteins In their active form, renaturation is 
required. Renaturation entails solubilizing, folding and sometimes oxidizing the protein to its native 
configuration. As a result of such renaturation process, various oligomers, isoforms and misfolded mon- 
omers, as well as a variety of bacterial proteins, are found in the crude refold mixture. This example 
illustrates a method of purifying desired proteins directly from a crude refold mixture in one metal-affinity 
purification step. 

Because IGF1 has no endogenous histidine residues, two changes in the native protein were made and 
purified according to the procedure set forth in Example 2. The elution profile is shown in Figure 1. In 
addition to IGF1 and a variety of bacterial proteins in the crude refold mixture, there is one major misfolded 
IGF1 monomer, IGF-related oligomers and at least one major IGF isoform. Upon purification of the crude 
refold mixture on a metal -affinity column, virtually all of the bacterial proteins were removed and most of the 
IGF oligomers were removed (residual oligomer content s 4%-8%). Ordinarily it is difficult to separate the 
properly folded IGF1 from the misfolded IGF1; however, as a bonus, the metal clearly recognized the two 
differently folded forms and cleanly separated them. As illustrated in Fig. 2, the properly folded 1GF1 binds 
strongest to the column. The misfolded IGF1 also binds reasonably well to the metal column, but the 
misfolding apparently distorts the helix which reduces its binding. The properly folded IGF1 isoform is 
easily separated by the metal column from the misfolded isomer but coelutes with the properly folded IGF1 
itself. A second purification step (reverse phase HPLC or size exclusion chromatography) easily removed 
the residua! IGF1 oligomer and also removed (reverse phase HPLC) the isoform. 



Claims 

1. A variant of a natural protein having a metal-binding amino acid residue at each of one or more positions 
which, in the natural protein, are occupied by other amino acid residues, such that the variant protein 
contains a metai-chelating amino acid sequence, said metal-binding amino acid residue being selected from 
residues of histidine and aspartic acid. 

2. A variant protein of Claim 1 wherein the metal-cheiating amino acid sequence is represented by the 
formula -A-Bx-Cy-Dz-E wherein A and E are Independently histidine or aspartic acid. B. C and D are amino 
acids and x, y, and z are independently integers from 0 to 3, provided that x + y + z In combination with 
the secondary structure of the segment of the protein containing the metal-chelating sequence affords a 
stereochemical arrangement of A and E adapted to form a chelate with an immobilized metal. 

3. A variant protein of Claim 2 wherein A and E are both histidine and x + y + z is equal to 3, said 
sequence being both engineered into said protein on an a- helical segment thereof. 

4. A protein of Claim 3 which is a variant of somatotropin. 

5. A variant somatotropin of Claim 4 having a histidine residue at one of positions 15 and 173. 

6. A variant somatotropin of Claim 5 which is a variant of bovine somatotropin. 

7. A variant somatotropin of Claim 5 which is a variant of porcine somatotropin. 

8. A protein of Claim 3 which is a variant of somatomedin C. 

9. A variant somatomedin of Claim 8 having histidine residues in both positions 8 and 12 where the natural 
somatomedin has Alag and Aspis- 

10. A variant somatomedin of Claim 8 having histidine residues at both positions 12 and 16 where the 
natural somatomedin has Aspi2 and Pheie- 

11. A DNA sequence coding for a variant somatotropin and having a codon for histidine in place of a codon 
in the DNA sequence which codes for native somatotropin, said codon in the DNA sequence which codes 
for native somatotropin being selected from the codons for Leu 15 and Thrwa- 

12. A DNA sequence of Claim 11 which codes for a variant bovine somatotropin. 

13. A DNA sequence of Claim 11 which codes for a variant porcine somatotropin. 

14. A DNA sequence coding for a variant somatotropin and having codons for histidine In place of the 
codons which, in the DNA sequence which codes for native somatomedin, code for Alas and Aspi2 or for 
Aspi2 and Pheis. 

15. In a method for fractionating proteins or polypeptides utilizing immobitized-metal affinity chromatog- 
raphy, the improvement which comprises enhancing the affinity of the protein or polypeptide for the 
immobilized metal by engineering within said protein or polypeptide, prior to fractionation, a metal-chelating 
amino acid sequence. 

16. A method for fractionating proteins or polypeptides comprising a variant protein as defined in any of 
Claims 1 to 9, which comprises fractionating the proteins or polypeptides utilizing immobillzed-metal affinity 
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chromatography. 

17. A method for increasing milk production in a female mammal which comprises administering to said 
mammal an effective amount of a variant mammalian somatotropin having a histidine residue where the 
native somatotropin has Leuis. 
5 18. Method of Claim 17 wherein the variant somatotropin is a variant bovine somatotropin and the mammal 
is a dairy cow. 
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QP]KEISGTV KLLGSKSLSN 
xPIKDISGTV KLPGSKSLSN 
QPIREISGLI KLPGSKSLSN 
EPIKDFSGTI TLPGSKSLSN 
QPIKEISGTV KLPGSKSLSN 
QPIREISGLI KLPGSxxxxx 
QPIARVDGTI MPGSKTVSN 
QPIARVDGAI N-LPGSKSVSN 
PAARLARGEV ALPGSKSISN 
G LPGSK--SN 



50 

RILLLA.aSE GTTWDNLLS 
RILLL^aSE GRTVVDNLLS 
RILLLAALSE GTTWDNLLN 
RILLLAALSE GTT\'VDNLLY 
RILLLAALSE GTTWDNLLN 
RILLLAALSE GTTWDNLLN 
RALLLAALAH GKTVLTKLLD 
RALLLAALAC GKTALTNLLD 
RVLLLAALAE GSTEITGLLD 
R-LLLAAL-- G-T LL- 



51 

Petunia. SDDIHmCA LKTLGLHV'EE 

Tomato. SDDIHYMLGA LKTLGLHVED 

Arabidopsis. SDOINYMLDA LKRLGLNVET 

-Soybean. SEDIHYMLGA LRTLGLRVED 

Maize. SEDYHYMLGA LRTLGLSVEA 

B.napus SODINYMLDA LKKLGLNVER 

E.coli SDDVRHMLNA LTALGVSYTL 

Salmonella. SDDVRHMLNA LSALGINYTL 

Bordetella. SDDTRVMLAA LRQLGVSVGE 

Consensus S-D ML-A L--LG 
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DSANQRAWE GCGGLFPVGK ESKEEIQLFL 
DN-ENQRAIVE GCGGQFPVGK KSEEEIQLFL 
DSENNUAWE GCGGIFPASI DSKSDIELYL 
DKTTKQAIVE GCGGLFPTSK ESKDEINLFL 
DKAAKRAWV GCGGKFPVE. DAKEEVQLFL 
DSVNNRAWE GCGGIFPASL DSKSDIELYL 

SADRTRCEII G NGGPL HAEGALELFL 

SADRTRCDIT G NGGAL RAPGALELFL 

VAD GCVTIEGVAR FPTEQAELFL 



101 

Petunia. GNAGTAMRPL TAAVTVAGGN 

Tomato. GNAGTA.MRPL TAAVTVAGGH 

Arabidopsis. G.NAGTAMRPL TAAVTA.\GGN 

Soybean. GNAGTAMRPL TAAWAAGGN 

Maize. GNAGTAMRPL TAAVTA.\GGN 

B.napus GNAGT-LMRPL TAAVTA.\GGN 

E.coli G.NAGTAMRPL AULCLGSND 

Salmonella. GNAGTAMRPL AAALCLGQNE 

Bordetella. GNAGTAFRPL TAAL.'VLMGGD 

Consensus GNAGTA-RPL -AA 
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SRYVLDGVPR MRERPISDLV DGLKQLGAEV 
SRYVLDG\TR MRERPIGDLV DGLKQLGAEV 
ARYVLDGVTR MRERPIGDLV VGLKQLGADV 
ASYVLDGVPR MRERPIGDLV AGLKQLGADV 
ATYVLDGVPR MRERPIGDLV VGLKQLGADV 
ARYVLDG\TR MRERPIGDLV VGLKQLGADV 
IV..LTGEPR MKERPIGHLV DALRLGGAKI 
IV..LTGEPA MLERPIGHLV DSLRQGGANI 
..YULSGVPR MHERPIGDLV D,\LRQFGAGI 
L-G-P- M-ERPI--LV — L— GA— 
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Petunia . 


DCFLGTKCPP 


Tomato . 


DCSLGTNCPP 


Arabidopsis . 


ECTLGTNCPP 


Soybean . 


DCFLGTiNCPP 


Maize . 


DCFLGTDCPP 


B. napus 


ECTLGTNCPP 


E.coli 


TYLEQENYPP 


Salmonella. 


DYLEQENYPP 


Bordetella. 


EYLGQAGYPP 


Consensus 


pp 
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Petunia. 


GDYEIEIIDK 


Tomato . 


GDVEIEIIDK 


Arabidopsis . 


GDYEIEIVDK 


Soybean. 


GDVEIEIVDK 


Maize . 


GDVEIEIIDK 


B . napus 


GDVEIEIIDK 


E. coll 


EDTVIRIKGD 


Salmonella. 


KDTIIRVKGE 


Bordetella. 


QDITIEWGE 


Consensus 


-D— I 



VRIVSKGGLP GGKVKLSGSI 
YRIVSKGGLP GGKVKLSGSI 
VRY^'ANGGLP GGKVKLSGSI 
VRVNGKGGLP GGKVKLSGSV 
VRVNGIGGLP GGKVKLSGSI 
VRV^'A-NGGLP GGKVXLSGSI 
LRLQGGF..T GGNVDVDGSV 
LRLRGGF..T GGDIEVDGSV 
LRIGGGSIRV DGPVRVEGSV 
_R Q GS- 



LISVPYVEMT LKLMERFGIS 
LISVPYVEMT LKLMERFGVF 
LISVPYVEMT LKLMERFGVS 
LISVPYVEMT LKLMERFGVS 
LISIPYVEMT LRLMERFGVK 
LISVPYVEMT LKLMERFGVS 
LVSKPY-IDIT LNLMKTFGVE 
LVSKPYIDIT LNLMKTFGVE 
LISKPYIEIT LNLMARFGVS 
L-S-PY— T L-LM— FGV- 

FIGURE 1(A) 
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SSQYLT.VLLM AAPL. . . .AL 

SSQYLTALLM AAPL AL 

SSQYLT.^LLM SAPL. . . .AL 
SSQYLTALLM AAPL. . . .AL 

SSQYLSALLM AAPL AL 

SSQYLTALLM AAPL AL 

SSQFLTALLM TAPL. . . .AP 

SSQFLT.aLM TAPL AP 

SSQFLTALLM AAPVLARRSG 
SSQ-L-.aLM -AP 
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VEHSSSWDRF FVRGGQKYKS 
VEHSSGWDRF LVKGGQKYKS 
VEHSDSWDRF FVKGGQKYKS 
VEHSGN^VDRF LVHGGQKYKS 
AEHSDSWDRF YIKGGQKYKS 
AEHSDS^DRF FVKGGQKYKS 
lENQHYQQ.F WKGGQSYQS 
lANHHYQQ.F WKGGQQYBS 
VRRDGm.F TIARDAVYRG 
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Petunia . 
Tomato . 
Arabidopsis . 
Soybean . 
Maize . 
B . napus 
E. coli 
Salmonella. 
Bordetella. 
Consensus 



Petunia. 
Tomato . 
Arabidopsis . 
Soybean . 
Maize . 
B. napus 
E.coli 
Salmonella. 
Bordetella. 
Consensus 
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PRKAFVEGDA 
PGKAFVEGDA 
PGNAYVEGDA 
PGNAFVEGDA 
PKNAYVEGDA 
PGNAYVEGDA 
PGTYLVEGDA 
PGRYLVTGDA 
PGRMAIEGDA 
P VEGDA 
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GAEYTWTENS 
GAEVTWTENS 
GCKVSWTENS 
GAKVTWSENS 
GAKVTWTETS 
xxxxxxxxxx 
GATICWGDDY 
GATITWGDDF 
GADVRYGPGW 
G 



SSASYFLAGA 
SSASYFLAGA 
SSACYFLAGA 
SSASYLLAGA 
SSATYFLAGA 
SSASYFLAGA 
SSASYFLAAA 
SSASYFLAAG 
STASYTLALG 
S-A-Y-LA-- 



AVTGGTITVE 
AVTGGTVTVE 
AITGETVTVE 
AITGGTITVN 
AITGGTVTVE 
AITGETVTVE 
AIKGGTVKVT 
AIKGGTVKVT 
AIGGGPVRVT 
A--G V- 



GCGTNSLQGD 
GCGTSSLQGD 
GCGTTSLQGD 
GCGTSSLQGD 
GCGTTSLQGD 
GCGTTSLQxx 
GIGRNSMQGD 
GIGRKSMQGD 
GVGEDSIQGD 
G-G--S-QGD 



VT\T(GPPRSS 
VTVKGPPRNS 
YTVTGPPRDA 
VTYSGPPRDF 
YTVTGPPRSH 
xxxxxxxxxx 

I s 

I :. . .A 

lETRGVRYAE 



SGRKHLRAID 
SGyXHLRAID 
FGMRHLRAID 
SGRKYLRGID 
FGRKHLKAID 
xxxxxxxxxx 
CTRGELNAID 
CTRGELHAID 
GGR. .LKAFD 
L-— D 



YNMNKMPDVA 
YNMNKMPDYA 
YNMNKMPDVA 
YNMNKMPDVA 
Y>'MM(MPDVA 
xxxxxxxxxx 
ifDMNHIPDAA 
MDMNHIPDAA 
ADFNLIPDAA 
PD-A 
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N'KFAEVXEKM 
NKFAEVLEKM 
\T(FAEVLEKM 
NKFAEYLEKM 
YKFAEVLEMM 
xxxxxxxxxx 
IRFADVLEKM 
IRFADVLELM 
VAFAATLAAli 
--FA--L--M 

350 

MTLAYVALYA 
MTLAWALFA 
UTLAYVALFA 
MTLAWALFA 
MTLAYVALFA 
xxxxxxxxxx 
MTIATAALFA 
MTIATTALFA 
MTAATLALYA 
MT-A--AL-A 
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Petunia. DGPTAIRDVA S^'RN'KETERM 

Tomato. DGPTTIRDVA SMMtETERM 

Arabidopsis. DGPTTIRDVA S'ATIN-KETERM 

Soybean. NGPTAIRDVA SVk'R\'KETERy 

Maize. DGPTAIRDYA S^HYTCETERM 

B. napus xxxxxxxxxx xxxxKETERM 

E.coli KGTTRLRNIY N^TlNTtETDRL 

Salmonella. KGTTTLRNIY NU-R^KETDRL 

Bordetella. DGPCRLRNIG SVrTlVKETDRI 

Consensus -G R--- -MrUVKET-R- 
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lAlCTELRKL GATN'EEGPDY CIITPPEKLN 
lAICTELRKL GATWEGSDY CIITPPEKLN 
lAlCTELRKL GaT\'EEGSDY CYITPPKK. . 
lAICTELRKL GATVEEGPDY CVITPPEKLN 
YAIRTELTKL GASVEEGPDY CIITPPEKLN 
lAICTELRKL GAT^'EEGSDY CVITPPAK. . 
F.^LATELRKV GAE^'EEGHDY IRITPPEKLN 
FA.MATELRKV GAENTEGHDY IRITPPAKLQ 
H.AMHTELEKL GAGVQSGADW LEVAPPEPGG 
-A--TEL-K- GA-V--G-D PP 



401 450 
Petunia. V..TDIDTYD DHRMAMAFSL A.J^CADVPYTI NDPSCTRKTF PNYFDVLQQY 
Tomato. " V. .TEIDTYD DHRM-OiAFSL A.\CADVPVTI KNPGCTRKTF PDYFEVLQKY 
Arabidopsis. VXTAEIDTYD DHPJiAitAFSL A-kCADVPITI NDSGCTRKTF PDYFQVLERI 
Soybean. V..TAIDTYD DHRiLUtAFSL A-\CGDVPVTI KDP.CTRKTF PDYFEVLERL 
Maize. V..TAIDTYD DHPJiLiLMAFSL MCAE^TVTI RDPGCTRKTF PDYFDVLSTF 
B. napus VKP.^IDTYD DRRMAMAPSL A.\CADVPYTI KDxxxxxxxx xxxxxxxxxx 
E.coli F..AEIATYN DHRMAMCFSL YALSDTPYTI LDPKCTAKTF PDYFEQLARI 
Salmonella. . .B-iiDIGTYN DHRM.\MCFSL V.^O^SOTPVTI LDPKCTAKTF PDYFEQLARM 
Bordetella. WRDAfllGTWD DHRMAMCFLL A.\FGPA.\YRI LDPGCVSKTF PDYFDVYAGL 
Consensus I-T— DHRMAM-F-L -A Y-I C— KTF P-YF 
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Petunia. SKB. 

Tomato . SKH . 

Arabidopsis. TKH. 

Soybean. TKB. FIGURE KB) 

Maize. VKN. 

B. napus xxx. 

E.coli SqAA 

Salmonella. STPA 

Bordetella. LAA. 

Consensus — '■ — 
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Conttruclion »nd Structuf* ol !h» dU.rm.d pTiT37.CO T-ONA 




635S-EPSPS-7S 

iColnUgnUon with 
pMOH915 V»cter 
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0 5 10 75 20 25 

26 mL TRISACRYL-IDA-Cu COLUMN 
-5 mg CRUDE HsHig-IGFl 

0.02 AUFS, 280nm, 8mm CELL 

500 mM NoCl 

50 mM NgH2P04,pH = 7.0 

1/2-40mM IMIDAZOLE 

FIG. 1 
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A = INJECTION PEAK 

8 = NATIVE SOMATOMEDIN C. 

C= UNFOLDED VARIANT (SULFUR-CAPPED-HEXATHIOSULFONATE DERIVATIVE) 

D=MISFOLDED SOMATOMEDIN C VARIANT 

E'- PROPERLY FOLDED SOMATOMEDIN C VARIANT. 

NOT SHOWN IS THE UNFOLDED NATIVE IGFI HEXATHIO SULFONATE 

WHICH doesn't BIND TO THE METAL COLUMN. 



FIG. 2 
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